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Abstract 

A word-graph is a digraph represented by a word to such that 
the vertex-set V{GJ) is the alphabet of u and the edge-set E(G U ) is 
determined by non-identical adjacent letter pairs in oj. In this paper 
we study the strong-connectivity of word-graphs. Our main result is 
that the number of strongly connected word-graphs represented by 
t— words of over an n— alphabet can be expressed via the recurrence 
relation T(£, n) on the Stirling numbers of the second kind using a 
link between word partitions and digraph connectivity. 

T ^n) = \ +EE ;F' i ' 1,n ' m)(B ' m " 1) 



1 Introduction 

A digraph D is representable if D can be mapped to a word in some manner. 
In this paper we study a particular form of edge-set encoding for word- 
representable digraphs that has close links with discrete stochastic processes. 



1 



Definition 1. A word-graph G u is a simple digraph associated with an 
£— word u> over an n— alphabet for i > n > such that the vertex set of 
Gu, V(G W ), is the alphabet of u and the directed edge-set of G u , E(GJ), is 
encoded from u using the set of non-identical adjacent letter pairs: 

V{G U ) = A(u) 

E{G W ) = E{u) = {(u) h u 2 ), (u 2 , u 3 ),..., {ui-x, u e )} 
such that for G E(G W ) it holds that Ui ^ 

The edge-set encoding in Definition [Tj is of particular note due to its close 
links with a range of stochastic word-based processes such as language, music, 
DNA and protein sequences. This form of edge-set encoding is also closely re- 
lated to the concept of digraph path decomposition Other forms of edge- 
set encoding have been used in the literature for both word-represent able 
graphs 0, 0] and word-representable digraphs [5|. These existing encodings 
are used for very specific applications (such as semigroup or DNA sequence 
analysis) whilst word-graphs have a wide range of general applications. 

The focus of this paper is on determining the criteria required for a 
word to represent a strongly-connected digraph and how many words, out 
of all I— words over an n— alphabet, represent strongly-connected digraphs. 
Strong-connectivity is an vital concept for word-graph theory because ev- 
ery strongly-connected word-graph, or component, can be represented by a 
single wordjij]. 

Definition 2. A digraph D is strongly-connected if there exists a path from 
every v% G V(D) to every Vj G V(D). 

This paper is organised into four sections. The remainder of the first 
section details previous results and the notation used in this paper. The 
second section describes our results and the third section contains proofs of 
these results. Future work is detailed in the final section. 

1.1 Previous results 

We say a digraph D is represent able, i.e. is a word-graph, if there exists 
a word u that encodes D in the manner specified by Definition [TJ Theo- 
rem [TJ provides the criterion to be satisfied for a digraph to be representable 
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although there are other ways to characterise representability such as forbid- 
den substructures or condensation forms^. 

Theorem 1. A digraph D is a word-graph (representable) if there exists 
a directed path P from Vi G V(D) to Vj G V(D) such that P transverses 
each edge of D at least once. The sequence of vertices visited by P is a 
representational word of D. 

Corollary 1. Representable digraphs are weakly-connected. 

Corollary 2. Strongly-connected digraphs are representable. 

In the same manner that digraphs are characterised by vertex and edge 
set cardinality, word-graphs are characterised by word length and alphabet 
cardinality. The number of word-graphs represented by words of length I 
over an alphabet of cardinality n can be expressed using Stirling numbers of 
the second kind[2(. 

Theorem 2. The set of I— words over an n— alphabet is of cardinality ■ 

The term {^} is a Stirling number of the second kind, or the number of 
ways of partitioning a set of cardinality £ into n subsets. If p is an n— partition 
of £ then the word corresponding to p would be: uj = PYP2 ■ ■ -p n , for some 
ordering (i.e. lexicographical) of the partition members. The nl term in 
Theorem [2] accounts for the permutation of the partitions to reflect the order 
in which they can occur in a word. We refer to the set of £— words over an 
n— alphabet as a word-graph family 2U(£, n). By Corollary [2j every strongly- 
connected digraph is represented by a word in some word-graph family. 

Finally we define Menger's Theorem^ which is used in the proofs in 
Section [3J 

Theorem 3. The edge-connectivity of a digraph is equal to the minimum 
number of edge-disjoint paths between any vertex pair. 

1.2 Notation and terminology 

An £— word uj over an n— alphabet is an ordered sequence of symbols of length 
£ — lo — oj\ . . .uj£ such that Ui G A(oS) for |A(u;)| = n where A(uS) is the 
alphabet of uj. A circular word is one for which oj\ = ue and a trivial word 
is one for which |A(u;)| = 1. 
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A factor of a word u is any ojj . . . contiguous subword. A partition, 
or a factorisation, p of a word u is a k— tuple for 1 < k < I of factors 
such that lo = p\P2 ■ ■ - Pk- A disjoint partition of a; is a partition such that, 
for each pi ^ pj pair of partitions, the alphabets of pi and pj are disjoint: 
A(pi) C\A(pj) = 0. A minimal disjoint partition of u is a partition of u with 
the smallest cardinality such that the partitions remain disjoint. 

The edge-connectivity of a word-graph is A(G aJ ) where X(G UI ) is the car- 
dinality of the edge-cut set S C E(G UJ ) such that S is the smallest set of 
edges that disconnects when removed. 

2 Results 

The initial results in this paper take the form of three lemmas concerning 
the structure of words that represent strongly-connected digraphs. The three 
lemmas build upon one another. The final lemma is used to prove a theorem 
on the number of words that represent strongly-connected word-graphs in 
a word-graph family. The three lemmas are based upon the link between 
word-graph strong-connectivity and edge-connectivity. 

Lemma 1. Iff X(G 0J ) > 1 then G^ is strongly-connected. 

A bridge is an unidirectional edge in G^ that when removed disconnects 
G u . When such a bridge exists in G u we say G u is 1-edge-connected or 
that \{GJ) = 1. Lemma Q] proves that word-graphs with bridges are the 
only word-graph which are not strongly-connected. This edge-connectivity 
property holds due to the way in which edges are encoded from the represen- 
tational word (Definition [1]). Using Lemma [1] we prove Lemma |2] which char- 
acterises the structural properties of words that represent strongly-connected 
digraphs. 

Lemma 2. Iff \{G U ) > 1 then u cannot be partitioned into disjoint factors. 

See Subsection 11.21 for a definition of disjoint factors. Lemma [2] is re- 
stated by Lemma [3] in a manner that facilitates the enumeration of strongly- 
connected word-graphs. 

Lemma 3. The number of £— words over an n— alphabet, with a unique fac- 
torisation, that represent word-graphs with X(G W ) > 1 is the number of 
n— partitions of an I— set such that no proper subset of parts has a union 
equal to the subset {1,2, ... ,j} for j < L 
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Via the three structural lemmas, the main enumerative result is proved 
in Theorem H] using a recurrence relation on Stirling numbers of the second 
kind. 

Theorem 4. There exist tp(£,n) strongly-connected word-graph members of 
the family 20(£, n) where: 

<p{l,n) = n\T(e,n) 

T ^n) = \ [ + EEU \W-j-l,n-m)(n-m-l) 

< > j=0 m=0 ^ ' 

The recurrence in Theorem H] was originally derived by Zabrockij^ for an 
algebraic problem but, as yet, no formal proof has been published. 

3 Proofs of results 

In this section we use the previous results from Subsection 11.11 to prove the 
theorems and lemmas introduced in Section [2j We start with a charac- 
terisation of the structure of strongly-connected word-graphs using edge- 
connectivity. 

Lemma 1. Iff X(G 0J ) > 1 then G^ is strongly-connected. 

Proof. A strongly-connected digraph D, by Definition |2j has two edge- 
disjoint paths between every vertex pair thus, by Menger's Theorem[6|], D 
is always 2-edge-connected. The remainder of this proof is dedicated to 
showing that if \(G U ) > 1 then G^ is strongly-connected. 

Let K n be the complete digraph of order n such that its vertices are 
labelled with the first n integers. Let w be a walk on K n and let G^ be the 
word-graph represented by u. We take bidirectional edges in G w to be pairs 
of unidirectional edges with opposing orientations. If the walk u resides in 
vertex Vj G V(K n ) then the walk can take one of two actions: 

i. transition from Vj to Vj + k for j + k < n 

ii. transition from Vj to for j — k > 1 

Assume, without loss of generality, that u starts on v% and, for the first 
n — 1 steps, uses action i so that G^ is weakly-connected (Corollary [T]) and, 
by Menger's Theorem^], is 1-edge-connected (Figured]). 
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Figure 1: G w after n — 1 steps on K n 

To ensure G^ is 2-edge-connected, the walk u must continue. The walk 
cannot use action i because it already resides in v n therefore it must use 
action ii to make the transition from v n to v n -j (Figure |2]). 




Figure 2: after n steps on K n 

G u now contains the strongly-connected component C = {v n , . . . , i> n -i}- 
By Menger's Theorem^, C is 2-edge-connected. To ensure the entirety of 
G u is 2-edge-connected, u must again make the transition outside C so that 
C becomes larger still. If C does not encompass all nodes in K n then G^ 
cannot be 2-edge-connected because G^ contains a bridge. Therefore the 
walk u must continue in this manner until the strong-component C encom- 
passed all vertices so that G w is 2-edge-connected and strongly-connected 
(Figure EJ. 




Figure 3: G w after n + k steps on K n 

Thus we have shown that 2-edge-connectivity is a necessary and sufficient 
condition for the strong-connectivity of word-graphs. □ 

Next we show that the structure of words that represent strongly-connected 
digraphs is characterised by the digraph's bridgelessness (2-edge-connectedness). 

Lemma 2. Iff \{GJ) > 1 then u cannot be partitioned into disjoint factors. 

Proof. Let p be a A;— partition of the £— word u. If p is a disjoint partition 
then 2 < k < n. If k > n then, by the pigeon-hole principle, there must 
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be some pair of non-disjoint partitions because | A(o^) | = n. If k = 1 then 
Pi = u and thus cannot be disjoint. 

Assume p is a disjoint k— partition of u for 2 < k < n. For any pi,pi + i 
partition pair, contiguous in u, of length u and v respectively, there exists an 
edge between the vertices pi iU and Pi+i, v in G w . Given that all pi partitions 
are disjoint, vertex p ijU occurs in no other partition but p^ and vertex Pi+i jV 
occurs in no other partition but pi + \. Therefore there exists only a single 
unidirectional edge between pt >u and Pi+i v 

(Figure HJ). 



\ o 

Figure 4: A word-graph represented by a contiguous disjoint partition 

Furthermore, because every pi_j,p i+ i +m partition pair is disjoint, there 
can be no path between p i)U and Pi+i, v except the {p%,u,Pi+i,v} edge. If that 
edge were to be removed then would become disconnected. As such, if uj 
can be partitioned into k > 1 disjoint factors then G w contains a bridge and 
A(G IlJ ) = 1. If u> cannot be partitioned into disjoint factors then G^ cannot 
contain a bridge and X(G U ) > 1. □ 

Corollary 3. If a word u can be minimally partitioned into k disjoint factors 
then G^ contains k — 1 bridges. 

Lemmas [T] and [2] show that words that represent strongly-connected di- 
graphs cannot be partitioned into disjoint factors. Lemma |3] restates these 
results in a manner that facilitates the enumeration of strongly-connected 
word-graphs. 

In Theorem |2l the n! term of ^!{^} accounts for factor permutation. In 
Lemma [3] the n\ term will be temporary ignored to make the proof simpler 
such that we only deal with the set of i— words over an n— alphabet with 
a unique factorisation, i.e. we are not concerned with the order of those 
factors. 

Lemma 3. The number of I— words over an n— alphabet, with a unique fac- 
torisation, that represent word-graphs with X(G W ) > 1 is the number of 
n— partitions of an I— set such that no proper subset of parts has a union 
equal to the subset {1,2, ... ,j} for j < L 
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Proof. Let p be an n— partition of an £— set with members equal to the first £ 
non-zero integers. Let each pi be mapped to an alphabet member aj G A(u) 
of some word uj. Let the j—th integer member of pi be the index of letter 
<2j G A(u) in u;. This construction gives an £— word over an n— alphabet: 

= h(pi) = ai (3.1) 

The number of n— partitions of an £-set is {^}, which is the same as the 
number of £— words over an n— alphabet with a unique factorisation. 

If a proper < k < £ subset of p has a union equal to the subset 
{1, 2, . . . ,j} for j < £ then p is deemed reducible. A word constructed from 
a reducible partition is called partition-reducible. For a partition-reducible 
£— word, the factor uj\ . . . ujj is over the alphabet {ai, . . . , a*,} and the factor 
Wj+i ■ ■ ■ uJe is over the alphabet {a^+i, . . . , a n }. Therefore there exists a dis- 
joint partition of u between Uj and Uj + ±. Thus partition-reducible words 
cannot represent strongly-connected word-graphs because the word-graphs 
are 1-edge-connected (Lemmas [TJ and [2]). 

If a partition is not reducible, it is irreducible. Let w be a partition- 
irreducible word formed from the partition p as described by Equation 13.11 
Assume 1 G p\ and because p is irreducible, the union of p\ with any proper 
subset of p — {pi} cannot be equal to {1, . . . , j}. Therefore there exists no 
disjoint partition of uj. Thus partition-irreducible words always represent 
strongly-connected word-graphs because the word-graphs are at least 2- 
edge-connected (Lemmas [TJ and [2]). □ 

Via Lemma [31 a restatement of Lemma [21 we can count the number of 
strongly-connected word-graphs in 2U(£, n). 

Theorem 4. There exist ip(£,n) strongly-connected word-graph members of 
the family 2U(£, n) where: 

<p(£,n) = n\T(£,n) 

T ^n) = \ U££JUT(*-j-l,n-ro)(n-ro-l) 

Proof. There are two ways to construct an irreducible i— set n— partition. 
First, let p be a partition of {1, 2, ...,£ — 1} into n subsets. Assume that 
1 G pi and add £ to p\. If p were a reducible partition then the union of, at 
maximum, n — 1 subsets of p would form the set {1, ...,£}. Besides 1 and 
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£ there exist £ — 2 members of the £— set distributed amongst the subsets 
of p. No subset can be empty thus in order to form the union {1, . . .,£}, 
Pi and all remaining i — 2 elements must be in the union. Such a partition 
cannot be reducible because the union is not formed from a proper subset of 
the partition. Thus partitions constructed in such a manner are irreducible. 
There exist: 



n 



(3.2) 



partitions of this type. 

For the second type, let p be an irreducible partition of £ — j — 1 into 
n — m and let p be a partition of j into m. Let pp be the union of the 
two partitions. The previous irreducible partition type counted all partitions 
with £ G pi such that 1 G p%. Putting £ & pi makes pp reducible because 
[Jpi = {1, 2, . . . , £ — j — 1}. The only remaining option is to put £ into pi G p 
such that 1 ^ pi, which ensures pp is irreducible because: 

\Jpi = {l,2,...,£-j-l,£} 

[jppi = {!,...,£} (3.3) 

Such a partition is irreducible because the only possible union equal to 
{1, . . . , j} is not formed from a proper subset of pp (Equation 13. 3p . 

For some fixed j and m, there exist T(£ — j — l,n — m) irreducible 
{1, ...,£ — j — 1} partitions into n — m; {^} partitions of {£ — j, . . . ,£ — 1} 
into m; and n — m — 1 ways of placing element £ into p; t E p such that p 
remains irreducible: 



T(£ — j — 1, n — m){n — m — 1) (3-4) 



To count all such partitions, equation 13.41 has to be summed over all 
< m < n — 2 because n — m — 1 < 1 for m > n — 2 and over all < j < £ — 2 
because £ — j — 1 < 1 for j > £ — 2. 

These two types of irreducible partition constructions must be disjoint 
because, for the first type, £ and 1 are always in the same subset whilst, 
for the second type, £ and 1 are never in the same subset. Likewise, all 
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irreducible partitions are of one of these two types. If £ and 1 are in the 
same subset of p then p is irreducible and must be of the first type because 
Equation 13.21 counts all such partitions. If 1 e p\ and £ e pi for i > 1 and p 
is irreducible then p is a partition of type two. If p were not of type two then 
£ must be in its own subset or in a subset with members of {j + 1, • • ■ , £ — 1} 
thus the union of a subset of {pi, . . . ,Pi-i} would be equal to {1, . . . , j} and 
so p would be reducible. 

Combining Equations 13.21 and 13.41 gives the required recurrence relation. 

T ^n) = \ L+££JUT(*-j-l,n-ro)(n-ro-l) 

j=0 m=0 ^ ' 

The base cases for T(£,n) are: 

T(£, to) = for £ < and £ < n (3.5) 
T(£,l) = l (3.6) 

The base cases in Equation 13.51 come from Definition [TJ The base case in 
Equation 13.61 comes from the fact that the trivial i— word uj with |A(u;)| = 1 
represents the singleton digraph which is strongly-connected. Including the 
to! term, to account for factor permutation (see Theorem[2]), gives the function 
ip(£,n) = n\T(£,n) which is the number of strongly-connected word-graphs 
in 2H(£, to). □ 



4 Future work 

We have shown how many members of a word-graph family, defined by a 
particular edge-set encoding with links to stochastic processes, are strongly- 
connected. Similar reasoning could be used for the number of word-graph 
with k strong-components. 

Conjecture 1. The number of £— words over an n— alphabet, with a unique 
factorisation, that represent word-graphs with k strong-components is the 
number of n— partitions of an £—set with k disjoint proper subsets of parts 
which have a union equal to the subset {j, j + 1, . . . , j + m}. Each such union 
is an alphabet-disjoint partition in the corresponding representational word 
and is a strong-component in the corresponding word-graph. 
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The word-graph family 2B(£, n) includes isomorphic digraphs. The set of 
£— words over an n— alphabet that represent non-isomorphic words-graphs 
can be thought of as a fundamental word-graph family. A standing question 
is: what is the cardinality of an arbitrary fundamental word-graph family 
and for what £, when n is fixed, is the cardinality maximised? These topics 
are related to the concept of minimal representational word length j^]. 
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